Can big data and random forests improve avalanche runout estimation compared to simple linear regression?
نویسندگان
چکیده
Accurate prediction of snow avalanche runout-distances in a deterministic sense remains challenge due to the complexity all physical properties involved. Therefore, many locations including Norway, it has been common practice define runout distance using angle from starting point end zone (α-angle). We use large dataset events Switzerland (N = 18,737) acquired optical satellites calculate α-angle for each avalanche. The α-angles our are normally distributed with mean 33° and standard deviation 6.1°, which provides additional understanding insights into distribution. Using feature importance module Random Forest framework, we found most important topographic parameter predicting be average gradient release area β-point. Despite modern machine learning (ML) method, simple linear regression model yield higher performance than ML attempts. This means that is better an operational context.
منابع مشابه
Random Forests for Big Data
Big Data is one of the major challenges of statistical science and has numerous consequences from algorithmic and theoretical viewpoints. Big Data always involve massive data but they also often include data streams and data heterogeneity. Recently some statistical methods have been adapted to process Big Data, like linear regression models, clustering methods and bootstrapping schemes. Based o...
متن کاملEstimation of suspended sediment concentration and yield using linear models, random forests and quantile regression forests
For sediment yield estimation, intermittent measurements of suspended sediment concentration (SSC) have to be interpolated to derive a continuous sedigraph. Traditionally, sediment rating curves (SRCs) based on univariate linear regression of discharge and SSC (or the logarithms thereof) are used but alternative approaches (e.g. fuzzy logic, artificial neural networks, etc.) exist. This paper p...
متن کاملImplementation of Random Forest Algorithm in Order to Use Big Data to Improve Real-Time Traffic Monitoring and Safety
Nowadays the active traffic management is enabled for better performance due to the nature of the real-time large data in transportation system. With the advancement of large data, monitoring and improving the traffic safety transformed into necessity in the form of actively and appropriately. Per-formance efficiency and traffic safety are considered as an im-portant element in measuring the pe...
متن کاملStrategies to improve neuroreceptor parameter estimation by linear regression analysis.
In an attempt to improve neuroreceptor distribution volume (V) estimates, the authors evaluated three alternative linear methods to Logan graphical analysis (GA): GA using total least squares (TLS), and two multilinear analyses, MA1 and MA2, based on mathematical rearrangement of GA equation and two-tissue compartments, respectively, using simulated and actual PET data of two receptor tracers, ...
متن کاملRobust linear registration of CT images using random regression forests
Global linear registration is a necessary first step for many different tasks in medical image analysis. Comparing longitudinal studies 1 , cross-modality fusion 2 , and many other applications depend heavily on the success of the automatic registration. The robustness and efficiency of this step is crucial as it affects all subsequent operations. Most common techniques cast the linear registra...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Cold Regions Science and Technology
سال: 2023
ISSN: ['0165-232X', '1872-7441']
DOI: https://doi.org/10.1016/j.coldregions.2023.103844